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Method for classifying information in portable data proces- 
sors AND PORTABLE DATA PROCESSOR 



FIELD 

[0001] The invention relates to a method for classifying information 
5 in a portable data processor and to a portable data processor. 

BACKGROUND 

[0002] The most basic form for classifying information in a data 
processor is carried out manually. For example, the user stores files, e-mail 
messages and web site links in a specific database, file structure etc. of a pro- 

10 gram in each information processing program. Such classification of informa- 
tion is one-dimensional and program-specific. Particularly in portable data 
processors, in which the user interface is generally more concise than for in- 
stance in personal computers, a need arises to improve the classification of 
information in order to alleviate the subsequent use thereof. Also the process- 

15 ing capacity that is lower in portable data processors than in personal com- 
puters and the limitations regarding power consumption set specific require- 
ments to the efficiency and user-friendliness of portable data processor soft- 
ware. 

[0003] Current e-mail programs include automatic filters that allow 
20 incoming e-mail messages to be automatically directed into a folder without 
requiring the user to take any real-time measures. The user must have set pa- 
rameters for the filtering in advance. This may create problems, since the user 
does not know in advance what kind of information can be sent to him/her. Fur- 
thermore, the classification is one-dimensional, although in reality the informa- 
25 tion characteristically belongs to various categories. The subsequent retrieval 
of the information classified in such a manner is difficult; generally key word 
searches are employed that provide several irrelevant results. Keyword re- 
trieval is also a fairly cumbersome procedure, as the search is carried out from 
a particular level in the directory tree, and the search is based on a few fixed 
30 steady fields (such as dates) and word retrievals. 

BRIEF DESCRIPTION 

[0004] It is an object of the invention to provide an improved method 
for classifying information in a portable data processor and an improved port- 
able data processor. 
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[0005] According to an aspect of the invention, there is provided a 
method for classifying information in a portable data processor comprising: 
processing information based on commands obtained from a user interface in 
the portable data processor; associating information multi-dimensionally into at 
5 least two different categories according to information type and at least one 
other criterion; presenting the associations in the user interface and carrying 
out processing related to the associations based on the commands obtained 
from the user interface; and storing the associations for subsequent use. 

[0006] According to another aspect of the invention, there is pro- 

10 vided a portable data processor comprising a processing unit for processing 
information, a user interface connected to the processing unit for presenting 
the information to a user of the portable data processor and for providing 
commands in order to process information, and a memory connected to the 
processing unit for storing information, and the processing unit is configured to 

15 associate information multi-dimensionally into at least two different categories 
according to information type and at least one other criterion, present the as- 
sociations in the user interface and carry out the processing related to the as- 
sociations based on the commands obtained from the user interface, and store 
the associations in the memory for subsequent use. 

20 [0007] The invention is based on the idea that classification is car- 

ried out when the user is processing information using a user interface, 
whereby the user also controls how the classification is carried out by means 
of the commands provided from the user interface. What is in fact concerned is 
a semi-automatic classification. In addition, the classification is multi- 

25 dimensional and corresponds better than a one-dimensional classification with 
the inherent properties of the information. 

[0008] The method and apparatus of the invention provide several 
advantages. The multi-dimensional properties of the classification enable to 
subsequently use the results thereof more efficiently. Already during the classi- 

30 fication the user may affect the results of the classification, whereby the sub- 
sequent usability thereof improves. The nature of the information is not rele- 
vant in the solution, and therefore the portable data processor comprises in an 
ideal case only one classification user interface, which functions in the same 
way in different programs, such as a web browser, file management and an e- 

35 mail program. It is easier for the user to learn the operation logic of one classi- 
fication program. Furthermore, very different kind of information associated 



with the same concepts or matters, can efficiently be classified to the same 
category or categories. 

LIST OF DRAWINGS 

[0009] In the following, the invention is explained in greater detail by 
5 means of the preferred embodiments with reference to the accompanying 
drawings, in which 

[0010] Figure 1 is a simplified block diagram showing an example 
structure of a portable data processor, 

[0011] Figure 2 shows an e-mail message, and 
10 [0012] Figure 3 is a flow chart illustrating a method for classifying in- 

formation in a portable data processor. 

DESCRIPTION OF EMBODIMENTS 

[0013] With reference to Figure 1, an example of the structure of a 
portable data processor is described. The apparatus may be a portable device 

15 associated with Ubiquitous Computing, such as a subscriber terminal, of a ra- 
dio system, such as a mobile phone or a PDA device (Personal Digital Assis- 
tant) or another portable data processor. Different roles may also be combined 
in the apparatus, meaning that it may for instance be a combination of a sub- 
scriber terminal and a PDA device, an example of which is the Nokia® Com- 

20 municator™. 

[0014] In our example in Figure 1, the data processor is a sub- 
scriber terminal in a radio system. Since a subscriber terminal is used as an 
example, the apparatus comprises an antenna 100 and a radio transceiver 
102. The radio transceiver 102 is for example a prior art mobile transceiver that 

25 operates for instance in the GSM (Global System for Mobile Communications), 
GPRS (General Packet Radio Service) or UMTS (Universal Mobile Telecom- 
munications System) network. 

[0015] In addition, the apparatus comprises a processing unit 104 
that controls and monitors the operation of the apparatus and the different 

30 parts thereof. The processing unit 104 also includes application programs of 
the apparatus, for example for processing a radio signal, for processing infor- 
mation and for managing a user interface. At present, the processing unit 104 
is implemented in general as a processor including software, but different 
equipment implementations are also possible, such as a circuit composed of 

35 separate logic components or one or more application-specific integrated cir- 



cuits (ASIC). The number of processors may if required exceed one. Also a 
hybrid of these different implementations is possible. A person skilled in the art 
observes when selecting the implementation for instance the requirements set 
for the size and power consumption of the apparatus, the required processing 
5 power, the manufacturing costs and the production volume. 

[0016] Furthermore, the data processor comprises a user interface 
108 connected to the processing unit 104. The user interface 108 can be im- 
plemented in accordance with the prior art. The user interface may comprise 
for example a keyboard for controlling the data processor, feeding information 

10 and providing information processing commands and a display for showing 
information, and also a microphone and a loudspeaker for processing sound. A 
chargeable battery 106 generally operates as the power source. The appara- 
tus also comprises a memory 112 connected to the processing unit 104 for 
storing information. The memory 112 can be implemented as is known in the 

15 art for implementing a non-volatile memory, for instance as memory circuits or 
as a small hard disc. 

[0017] In order to implement the classification of information, the 
processing unit 104 is configured to associate information multi-dimensionally 
into at least two different categories according to information type, and at least 

20 one other criterion. The processing unit 104 is also configured to present the 
associations in the user interface 108 and to carry out the processing related to 
the associations based on the commands obtained from the user interface 
108. The processing related to the associations can be implemented for in- 
stance in such a manner that the user is offered three possibilities through the 

25 user interface 108; to accept an association, to reject an association, to 
change an association. The user then performs the choice in the user interface 
108 using the command provided, which is interpreted in the processing unit 
104, and then the desired processing related to the association is carried out. 
In an embodiment, the information is a file, and the processing unit 104 is con- 

30 figured to carry out the presentation when storing or closing a file. In an em- 
bodiment the information is a web site and the processing unit 104 is config- 
ured to carry out the presentation when exiting the web site, or when closing a 
browser used for browsing, or later when the data transmission connection of 
the portable data processor is offline. 

35 [0018] The processing unit 104 is configured in such a manner that 

it is capable of processing different types of information. In an ideal case, all 



information processed in the data processor can be classified using consistent 
processing that the processing unit 104 offers and a similar user interface 108. 
The information may be of the following types: a file, an e-mail message, a 
web site, a text message, a multi-media message, a calendar message, a task 
5 message, and another set of data presented using alphabetic and/or numeric 
characters, binary data (such as a sound file, an image file, a video file, an ex- 
ecutable program file). 

[0019] The criteria used in association naturally vary according to 
information type, and therefore listing all possible combinations becomes diffi- 

10 cult. The criteria may for example be the following ones: title of information (the 
title as an entity or individual words therein), contents of information, context 
information associated with information, location information associated with 
information, links associated with information, meta data of information, and a 
caller group division of a subscriber terminal in a radio system. 

15 [0020] The processing unit 104 can be configured in connection 

with the processing related to the associations also to carry out processing 
related to categories. The processing related to categories can be imple- 
mented for instance in such a manner that through the user interface 108 the 
user is offered different possibilities to function as regards the categories, for 

20 example the following possibilities: deleting a category, changing the proper- 
ties of a category, creating a new category and associating information to the 
created category. The matters associated with the categories shown in the 
user interface 108 and the commands related to the categories obtained from 
the user interface 108 thus enable the user to delete an existing category, cre- 

25 ate a new category and change the properties of existing categories. 

[0021] The processing unit 104 is configured to store the associa- 
tions in the memory 1 1 2 for subsequent use. 

[0022] Configuring the processing unit 104 forms structural entities, 
which can be implemented for instance as program modules, coded with a 

30 programming language, such as a C programming language, a C++ program- 
ming language, a machine language or an assembler that is stored as a run- 
time version into the memory in the processor and that is run by the processor. 
Instead of translatable programming languages, interpretable programming 
languages can naturally also be used, presuming that the use thereof meets 

35 the required processing rate and power. When implementing the processing 
unit 104 in ASIC form, the structural entities are ASIC blocks. 
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[0023] The processing unit 1 04 can be configured to perform proc- 
essing and association in parallel or in turn. What "processing" refers to in this 
context is processing information simultaneously as it is presented in the user 
interface 108, and the user interface 108 is used to provide commands for 
5 processing information. In an embodiment the information is a file, and the 
processing unit 104 is configured to carry out association when opening, stor- 
ing or closing the file. In another embodiment the information is a web site, and 
the processing unit 104 is configured to carry out association when browsing to 
the web site. Parallel computing refers to the fact that processing is carried out 

10 as a specific process or task and association as a specific process and task. 
Parallel processing requires a multi-tasking computer, which can be imple- 
mented using an operating system that allows multi-tasking. Such a solution is 
also possible, in which processing is carried out in turn, either so as to perform 
processing at first and association thereafter, or then so as to perform tasks in 

15 turn a particular time slice at a time. The latter way is one way to implement 
virtual parallelism. 

[0024] In an embodiment the processing unit 104 is configured to 
remind the user in the user interface 108 about the association stored in the 
memory 112. The association associated with the information may thus help 

20 the user to remember a necessary matter. A simple example is such where the 
processing unit 104 is configured to determine the present instant of time and 
to carry out a reminder concerning the association associated with the deter- 
mined instant of time stored in the memory 112. An association may for in- 
stance be associated with a particular date and/or time. The processing unit 

25 104 can be configured to determine the state of the subscriber terminal in the 
radio system (for example a profile currently used in a mobile phone, or infor- 
mation about whether a call is in progress), and to send a reminder in the user 
interface 108 if it fits into the determined state. For instance, during an ongoing 
call, the reminder is probably not performed. On the other hand, the reminder 

30 can be carried out in the user interface 112 concerning the association associ- 
ated with the determined state - stored in the memory 112. For example, a re- 
minder associated with a particular profile can be carried out; for example, 
when the profile is a "meeting" the reminder may concern an e-mail message 
about the meeting. 

35 [0025] Reminding based on a more complicated logic is also possi- 

ble. In an embodiment the processing unit 104 is configured to sense the op- 
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erational environment of the portable data processor and to perform the re- 
minder in the user interface 108 concerning the association associated with the 
sensed operational environment stored in the memory 112. Sensing the opera- 
tional environment of the data processor may take place in different ways. Fig- 
5 ure 1 describes a sensor 110 that senses the operational environment. The 
sensor 110 may for instance be a GPS receiver (Global Positioning System) or 
a receiver of another positioning system, which allows defining the location of 
the data processor. A separate sensor 110 is not necessarily required, since 
the subscriber terminal may be fairly accurately located in the current mobile 

10 systems on the basis of sent and/or received radio signals. The sensor 110 
may also be very simple, for instance an electrical or electromechanical sensor 
indicating that the data processor is connected to a desk-top charger or to a 
car charger, for example. 

[0026] In an embodiment the processing unit 104 is configured to 

15 determine the context information associated with the location of the sub- 
scriber terminal in the radio system and to carry out in the user interface 108 
the reminder concerning the association associated with the determined loca- 
tion stored in the memory 112. This can be implemented for instance so that 
the terminal comprises a Bluetooth™ transceiver or another device implement- 

20 ing local connectivity that allows forming an ad hoc network. An integrated cir- 
cuit according to the Bluetooth™ technology implements a radio connection 
having a coverage of a few hundred metres at the most, for instance at a fre- 
quency of 2.4 giga hertz. For example, when the terminal is close enough to a 
home communication centre communicating using Bluetooth™, it may be as- 

25 sumed that the location is home. Or, when one is close to a ticket machine, the 
user may be reminded about the fact that a ticket must be bought for the jour- 
ney taking place the day after tomorrow. 

[0027] Let us next take a closer look at the classification of a simple 
e-mail message described in Figure 2 in a portable data processor. The e-mail 

30 programme to be run in the processing unit 104 of the data processor has re- 
ceived an e-mail message 200. The user interface 108 connected to the proc- 
essing unit 104 enables to present said information to the user when the e-mail 
programme has received from the user interface 108 a command provided by 
the user to read the e-mail message. The e-mail message may already be 

35 stored in the memory 1 12 or it may be stored therein later. The processing unit 
104 is configured to associate the e-mail message multi-dimensionally into at 
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least two different categories according to information type and at least one 
other criterion. In an embodiment the processing unit 104 is configured to carry 
out association when opening the e-mail message for reading either by proc- 
essing in parallel or in turn as shown above. In addition, the processing unit 
5 104 is configured to present the associations in the user interface 108 and to 
carry out the processing related to the associations based on the commands 
obtained from the user interface 108, and to store the associations in the 
memory 112 for subsequent use. In an embodiment the processing unit 104 is 
configured to carry out the presentation when the e-mail message is being 

10 closed or when moving to the following e-mail message. If the user rejects all 
proposed associations, the e-mail message can be placed into an incoming e- 
mail message file, or it may be moved to the recycle bin. 

[0028] A classification algorithm to be run in the processing unit 104 
may search from the e-mail message, for instance, the following classification 

15 data: sender, date of arrival, priority. If the classification is based on searching 
key words from the text, then words like "summer holidays" and "proposal" can 
be found from the subject field in header data 202 of the e-mail message 200. 
Furthermore, character strings that resemble the general formats of dates can 
be retrieved from a text part 204 in the e-mail message 202. 

20 [0029] Then the retrieved classification data are compared with the 

classification criteria. Basically connecting two elements, like classification data 
(such as John Doe) and classification criteria (such as a sender), together pro- 
vides an association between the two. Thus, the classification proposal of the 
e-mail message 200 could for instance be the following: 

25 [0030] - type = e-mail message, 

[0031] - sender = John Doe, 
[0032] - received = 15 May 2002, 
[0033] - priority = normal, and 

[0034] - subject = "summer holidays", "proposal", "31 May 2002". 
30 [0035] If the user accepts the proposed classification, the user's 

portable data processor may remind the user about this message, for instance 
one day before said date 31 May 2002. 

[0036] The user may also add previously defined groupings to the 
classification. A grouping typically found in mobile phones in particular is a di- 
35 vision based on caller groups. The example message presented herein could 
allegedly belong to the group "work", since the message has arrived from the 



manager that belongs to the caller group "work". However, the user may him- 
self/herself control such a grouping, and therefore he/she may if desired add 
"home" as a grouping, so as to remember to agree upon his/her days off with 
his/her family while being at home. This creates the function already described 
5 above at the search function side: if the mobile phone is capable of sensing 
the operational environment thereof, then it is also capable of reminding about 
domestic matters at home and about business matters at work. In this exam- 
ple, the portable data processor might remind about the days off when it de- 
tects that the user is at home and the stipulated date 31 May 2002 stored is 
10 approaching. Naturally the apparatus does not have to be able to observe the 
changes in the environment, and the logic of the reminders may as well be 
based on time or other corresponding data as shown above. 

[0037] An alternative classification for an e-mail message could be 
as follows, using a slightly different classification algorithm: 
15 [0038] - type = e-mail message, 

[0039] - sender = John Doe, 

[0040] - received = 15 May 2002, 

[0041] - priority = normal, 

[0042] - stipulated date = 31 May 2002 (selected by the user) 
20 [0043] - first group = work, and 

[0044] - second group = home (selected by the user). 
[0045] What is important is to note that the classification may sug- 
gest an association, such as a stipulated date, and the addition of a previously 
defined group (i.e. for instance that the e-mail message belongs to both groups 
25 "work" and "home"). 

[0046] Another extreme classification example is a classification 
carried out purely based on text type (in contrast to the structural classification 
described above), whereby all different keywords are of similar value, i.e. 
"John Doe", "15 May 2002", "summer holidays", "proposal" and "31 May 2002". 
30 [0047] The classification itself, i.e. retrieving the classification crite- 

ria and classification data, can be carried out in accordance with the prior art. 
The classification algorithm may vary. For instance, technologies derived from 
a self-organizing map (SOM), which is a neuronetwork invented by the Finnish 
professor Teuvo Kohonen, can be applied to implement the classification algo- 
35 rithm. The SOM is described in publication Teuvo Kohonen: Self-organizing 
Maps, third extended edition 2001, ISBN 3-540-67921-9. The SOM or the ex- 
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tensions thereof, such as WEBSOM, have been used for example for classify- 
ing XML documents (Extensible Markup Language). However, what is con- 
cerned here regarding the classification, is an alternative to how the classifica- 
tion algorithm can be carried out. The technologies further derived from SOM 
5 are only a branch in an extensive algorithm tree. The use of WEBSOM is de- 
scribed in publication Honkela, T., Kaski, S., Lagus, K & Kohonen, T. (1996). 
Newsgroup Exploration with WEBSOM method and browsing. Technical Re- 
port A 32, Helsinki University of Technology, Laboratory of Computer and In- 
formation Science, Espoo, Finland. 

10 [0048] For retrieving classification data, "keyword spotting" can for 

example be used, where the idea is to automatically find possibly predeter- 
mined and best describing key words from the text, or "text mining", where 
words describing the text are retrieved, while common words such as "and" 
and "or" are left unnoticed. Also semantic networks can be utilized. The use of 

15 semantic networks allows classifying information. The classification data is pre- 
liminary stored in the material, for instance on the web sites of a World-Wide 
Web (WWW) server. The principle of a semantic network can be utilized for 
retrieving various key data from the material, since metadata is added to the 
information in a semantic network that informs the reader for example about 

20 the nature of the information. The classification algorithm may thus find classi- 
fication criteria in accordance with the principles of a semantic network. 

[0049] Several different, also multi-dimensional, automatically func- 
tioning classifiers can be used to implement a classification algorithm: how- 
ever, in this semi-automatic way, the user may affect the classification him- 

25 self/herself, in which case the final result is useful and the described classifica- 
tion can be used for more heterogeneous data than typical automatic classifi- 
ers. No sides are taken regarding the classification algorithm, and the algo- 
rithm may for instance be self-learning, although a user is typically him- 
self/herself capable of perceiving interdependencies more effectively than a 

30 mere automatic algorithm. 

[0050] In the following, with reference to the block diagram in Figure 
3, a method is described for classifying information in a portable data proces- 
sor. The process starts in 300. Then in 302, information is processed on the 
basis of commands obtained from the user interfaces of a portable data proc- 

35 essor. It is presumed that the information to be processed is stored in a data 
store 314, from which the data can be retrieved for processing 302. The infor- 
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mation in the data store may be of different types, such as: a file, an e-mail 
message, a web site, a text message, a multi-media message, a calendar 
message, a task message, another set of data presented using alphabetic 
and/or numeric characters, or binary data. When the information is a file, asso- 
5 ciation 312 can be carried out when opening, storing or closing the file, and a 
presentation 306 can be carried out when storing or closing the file. When the 
information is an e-mail message, the association 312 can be carried out when 
opening the e-mail message for reading, and the presentation 306 can be car- 
ried out when closing the e-mail message or when moving to the following e- 

10 mail message. When the information is a web site, the association 312 can be 
carried out when browsing at the web site, and the presentation 306 can be 
carried out when exiting the web site, or when closing the browser used for 
browsing, or later on when the process is offline. 

[0051] Then in 312 the information is multi-dimensionally associated 

15 into at least two different categories according to information type and at least 
one other criterion. The criteria comprise at least one of the following: title of 
information, contents of information, context information associated with infor- 
mation, location information associated with information, links associated with 
information, meta data of information, caller group division of subscriber termi- 

20 nals in a radio system. 

[0052] In this example, information processing is continued in 304. 
Such a division of information processing into 302 and 304 mainly describes 
how processing in 302 and 304 as well as association in 312 can be carried 
out in parallel or in turn as is shown in Figure 3. Processing can also be carried 

25 out in various stages, for instance an e-mail message can be opened in 302 
and in 304 the process moves to the following e-mail message. 

[0053] Then in 306, the associations are presented in the user inter- 
face and in 308 the processing related to the associations is carried out based 
on the commands obtained from the user interface. In an embodiment the 

30 processing related to the associations comprises at least one of the following: 
accepting an association, rejecting an association, changing an association. In 
an embodiment, in connection with the processing related to the associations, 
processing related to categories is also carried out, for instance in such a 
manner that the processing related to categories comprises at least one of the 

35 following: deleting a category, changing the properties of a category, creating a 
new category and associating information into the created category. 
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[0054] Finally in 310, the associations are stored for subsequent 
use in the data store 316. The method ends in 318. Even though the data 
stores 314 and 316 are described as separate reserves, they are able to be 
combined in certain conditions. 
5 [0055] In an embodiment, the method further comprises: a portable 

data processor reminds the user in the user interface about the stored associa- 
tion. This can be implemented in various ways. For example in such a manner 
that the method also comprises: a portable data processor senses the opera- 
tional environment thereof and carries out the reminder in the user interface 

10 concerning the stored association associated with the operational environment 
thereof. Or, in such a manner that the method further comprises: a portable 
data processor determines the present instant of time and for carrying out a 
reminder in the user interface concerning the stored association associated 
with the determined instant of time. The portable data processor may deter- 

15 mine the state of the subscriber terminal in the radio system, and send the re- 
minder in the user interface, if it suits the determined state. The portable data 
processor may also carry out the reminder in the user interface concerning the 
stored association associated with the determined state. The portable data 
processor may determine the context information associated with the location 

20 of the subscriber terminal in the radio system, and carry out a reminder in the 
user interface concerning the stored association associated with the deter- 
mined location. 

[0056] A portable data processor as described above is applicable 
to carry out the method, but other kinds of apparatuses can also be employed 
25 for carrying out the method, in which information is processed and which com- 
prise a user interface and utilize as described the classification of the informa- 
tion. 

[0057] Even though the invention has above been explained with 
reference to the example in the accompanying drawings, it is obvious that the 
30 invention is not restricted thereto but can be modified in various ways within 
the scope of the inventive idea disclosed in the attached claims. 



